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1 Introduction 

One of the primary tasks of a computer vision system is to reconstruct from two- 
dimensional images certain three-dimensional properties of a scene such as motion, shape, 
and the spatial arrangment of objects in the scene. In monocular vision, the goal is to 
recover, from time-varying images, the relative motion between a viewer and the environ- 
ment as well as the structure of the environment. The structure of the scene is usually 
denned in terms of the relative depth of points on the visible part of the surface of the 
scene. 

An important issue in motion vision is whether the solution can be determined 
uniquely. Alternatively, one may ask if there exist situations that give rise to an am- 
biguity in the interpretation of three-dimensional motion and shape. 

Apparently, Hay [1966] was the first to report the observation that two planar surfaces 
each undergoing different rigid motions can produce the same instantaneous motion in 
the image plane. The same observation and the proof of the existence of at most two 
solutions in the case of planar surfaces has since been reported by Tsai et al. [1982], 
Waxman & Ullman [1985], Longuet-Higgins [1984], Maybank [1984], and Negahdaripour 
& Horn [1985]. 

In the case of curved surfaces, two types of approaches, based on local and global 
representation of three-dimensional surfaces, have been pursued. In the local approach, 
the surface is represented by its Taylor series expansion in some neighborhood of the 
fixation point, based on the assumption that the surface has continuous derivatives up 
to some order n in that region. In the global approach, no special model is assumed and 
the depth values are allowed to vary arbitrarily from one point to the next. 

Using a local second-order analysis, Longuet-Higgins & Prazdny [1980] show that 
three interpretations are possible for the motion parameters and the local structure of 
the surface of the scene (tangent plane orientation and surface curvature). Waxman et 
al. [1986] derive the special cases which give rise to the three-fold ambiguity observed 
by Longuet-Higgins & Prazdny. In addition, they show that other situations can give 
rise to a two-fold ambiguity (similar results were derived by Negahdaripour & Yuille 
[1986]). Negahdaripour [1986] shows that only certain hyperboloids of one sheet and 
circular cylinders can give rise to an image motion field with multiple interpretations. 
The ambiguity of hyperboloids of one sheet can be either two-fold or three-fold. The 
ambiguity associated with circular cylinders is two-fold and can be viewed as a degenerate 
case of the three-fold ambiguity of hyperboloids of one sheet. Negahdaripour [1986] also 
shows that most of the ambiguities observed by Waxman et al. are the shortcomings of 
a local second-order analysis of the motion field. 

The same problem has been addressed using a global analysis. Fang & Huang [1984] 
use the correspondence of nine image points in two frames to show that the motion 
parameters can be determined uniquely unless the points lie on a second-order surface 
passing through the viewing point. Tsai & Huang [1984 show that the correspondence of 
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seven image points is sufficient to recover motion uniquely unless the points lie on either 
two planes with one passing through the viewing point or a cone that passes through 
the viewing point. In an elegant derivation, Horn [1986b] shows that the class of curved 
surfaces that may give rise to the same motion field is restricted to certain hyperboloids 
of one sheet that are viewed from a point on their surface. He also shows that the special 
cases observed by Tsai & Huang are degenerate cases of his results. 

In this paper, we present some results related to the ambiguities in the interpretation 
of three-dimensional motion of curved surfaces. These results can be summarized as 
follows: 

(1) Only certain hyperboloids of one sheet or cylindrical surfaces that are viewed by an 
observer moving parallel to the image plane can give rise to an ambiguity in the 
interpretation of a perspective motion field. These ambiguities occur under two rare 
conditions and can be either two-fold or three-fold. In either case, the resulting motion 
field is quadratic. 

(2) When multiple solutions exist, the relationship among them can be derived in closed- 
form. 

(3) With a large field of view, it is generally possible to identify the correct solution by 
imposing the constraint that depth is positive over the image region onto which the 
surface projects. 

2 Motion Field 



We assume a viewer-centered coordinate system. The optical axis is chosen along the 
2-axis, and the image is formed on the plane z = 1; that is, without loss of generality, 
the focal length is chosen to be unity (Figure 1). Let R = [X,Y y Z] T be a point in the 
scene that projects onto the point r = [x, y, 1] T in the image. Assuming a perspective 
projection, we have 

Rz 

where Z = R • z is the distance of the point R from the viewer along the optical axis. 
Suppose the viewer moves with relative translational and rotational velocities t and u 
with respect to the scene. Then a point in the scene moves with respect to the viewer 
with velocity 

Rf = — R x u — t. 

The corresponding image point moves with velocity (Negahdaripour & Horn [1987]) 

r* = -z x (r x (r x w - =-rt)) ) 
\ v R • z 




Figure 1. Viewer-centered coordinate system. 



Note that r* • z = and, therefore, the third component of the 3D vector r* is always zero, 
as expected. The velocities of all image points taken collectively defines a two-dimensional 
vector field that we call the image motion field (Negahdaripour & Horn [1987]). This 
vector field has been referred to elsewhere as the optical flow field (see Horn [1986a] or 
Negahdaripour [1986] for the distinction between the motion field and optical flow). 

The motion field remains unchanged if the translational vector t and the depth values 
Z are multiplied by the same constant factor. Therefore, we can recover depth and the 
translational motion from image motion field only up to a scale factor; this has been 
referred to as the scale-factor ambiguity in motion vision. 



3 The Uniqueness Issue 

The motion field is a purely geometric concept. It is uniquely defined in terms of the 
observer motion and the scene structure. More precisely, once we specify the motion of 
the viewer as well as the structure of the scene, the motion field is unique as given by the 
earlier equation; that is, 

3D motion and structure — ► 2D image motion 

Ideally, the motion field over some region of the image can be used to recover the 
relative motion, w and t, as well as the structure of the scene (defined in terms of the 
depth values of the points on the surface of the scene, Z) up to a -< ale factor. The issue 
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we address is the uniqueness of the solution: 

2D image motion — > ' 3D motion and structure 

More precisely, the question we answer is: Given the motion field over some region of 
the image, when can we recover the motion parameters and the scene structure uniquely 
(up to the scale-factor ambiguity)? Note that the solution may be non-unique because 
the relationship between the two-dimensional image motion field and the underlying 
three-dimensional structure and motion is non-linear and, therefore, there need not be a 
one-to-one correspondence. Alternatively, we are interested to know what circumstances 
may give rise to an ambiguity in the interpretation of the motion field. In general, the 
problem of robust recovery of 3D structure and motion can be more complicated (than 
deriving the uniqueness results) since we cannot compute the motion field accurately 
enough from time-varying images. The inaccuracy in the estimate of the motion field 
obtained from noisy data may then introduce additional ambiguities in the interpretation 
of three-dimensional motion and shape (Ullman [1983], Jerian & Jain [1983], Adiv [1985]). 
In this paper, we restrict attention to the types of ambiguities that arise even when the 
exact motion field is used. 

4 Surface Representation 

One of the issues in deriving uniqueness results for curved surfaces is the choice of an ap- 
propriate representation of general three-dimensional surfaces. For example, the surfaces 
of a scene may be represented by the relationship Z = Z{X, Y), where R = [X, Y, Z] T is a 
point on the surface of the scene. This will be referred to as a global representation since 
we can define the whole surface in this form. There is no constraint on the relationship 
among the depth values of neighboring points and the surface need not have a particular 
structure in local regions; for example, it need not be smooth. The surfaces of most 
physical objects, however, possess some degree of "regularity" at least in local regions; 
the regularity or smoothness of a surface can be measured in terms of the continuity in 
the surface function and its derivatives. In other words, most surfaces have continuous 
derivatives up to some order n in local regions. Then we can represent a surface by its 
Taylor series expansion up to order n in a local region, assumed, for simplicity, to be 
around the fixation point: 

Z = Z + Z X X + Z Y Y+]-Z X xX 2 + Z XY XY + \z YY Y 2 + ...+ 

^Zx...xX n + ... + ^Z Y ... Y Y n + O(e). 
n! nl 

Alternatively, the surface may be represented by a Taylor series expansion in terms of 
the image coordinates. This is a more convenient representation when we deal with 
images. Finally, we may write the expansion of d = 1/Z (i liis is a measure related to 



the disparity function in stereopsis), instead of the expansion of Z, since the motion field 
equation depends on 1/Z (as a result of perspective projection). Therefore, the surface 
can be represented by the equation 

d = d + d x x + d y y+-d xx x 2 + d xy xy + -d yy y 2 + ...+ 

—d x ... x x n + ... + —d y ... y y n + 0(e). 

Ignoring the error term, this can be written more compactly using tensor notation, 

d = d + dpi + -dijTiTj + . . . H — jdij... r i r j •'• ■> 

where r t - = [x, y] T . Since depth can be recovered only up to a scale factor, we can set 
d = 1. Therefore, we can write 

1 , 1 , 

d = 1 + dpi + —dijrifj + . . . H rdij... r »' r i 

2 n\ 

This will be referred to as a local representation since it may be the true representation 
of the surface only in a local region near the fixation point. One justification for using 
a local representation is that we only need a finite number of parameters, namely, the 
coefficients of the Taylor series (rf,-, d,-y, . . . ) to represent the surface, where the number 
of parameters is related to the order of the Taylor series. We may not need to impose 
any restriction on n in addressing the uniqueness issue since we deal solely with a purely 
theoretical problem. The problem of robust recovery of motion and shape, however, is a 
separate issue. In practice, we cannot robustly determine the coefficients of the surface 
function beyond the linear terms due to noise in the data (Adiv [1985], Le Guilloux 
[1986]). Furthermore, the resulting non-linear problem is usually ill-conditioned. 

Using a local representation of curved surfaces, the motion vision problem reduces 
to estimating a finite number of unknowns, namely, the motion and surface parameters, 
from the image data. Similarly, the uniqueness issue translates into the following question: 
How many sets of motion/ surface parameters can give rise to the same motion field over 
the image region of interest? 

5 Ambiguities in the Interpretation of the Motion Field 

If we substitute the equation for d = 1/Z into the motion field equation, we arrive at the 
Taylor series representation of the motion field, rt — ', « , v, 0] T , in the image region under 
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consideration; that is, 

1 2 1 2 

u = u + u x x + u y +-u xx x + u xy xy + -u vy y + 

£ It 

-u xxx x z + -u xxy x 2 y + -u xyy xy 2 + -u y yyy 3 + ... 

1 2 1 2 

v = v + v x x + v y +-v xx x + v X yxy + -v yy y + 

It Jd 

~V XXX X Z + ~V xxy X 2 y + -V xyy Xy 2 + -Uyyyy 3 + • • • 

where the coefficients of the Taylor series expansion of the motion field are given by 

u o = - w y -*x v = w x - ty 

u x = t z — d x t x v x = —w z — tyd x 

Uy ^ Wg t x (ty Vy -— tg £y"y 

U xx = IVy -\- tg&x *'* x Qi XX Vxx = &ty(*xx 

U X y = W x t tgdy t x d X y Vxy == Wy + t z O, x *y™>xy 

Uyy = —2t x dyy Uyy = W x + tgdy — 2ty(lyy 

l^xxx =z 3tg(l xx t x d xxx Vxxx == lydxxx 

U XXy = 4lgd X y — t X d XX y Vxxy = *zd-XX ~ tyd XX y 

u ayy = *«%y — *xd X yy v xyy = **zdxy ~ *yjd X yy 

u yyy = ~^xd yy y v yy y = Zt z d yy — tyd yy y 



The question regarding the uniqueness of the solution can be rephrased as follows: 
Under what circumstances can we obtain the same set of motion field coefficients up to 
some order n for different sets of motion/surface parameters? 

Considering the motion field coefficients up to the second-order terms, Longuet- 
Higgins & Prazdny [1980] showed that three interpretations for the motion and local 
structure of the surface (tangent plane orientation and surface curvature) are generally 
possible. They arrived at this conclusion by reducing the problem of determining the 
motion and surface parameters to solving a cubic equation; however, they did not show 
when the cubic equation can possess three real solutions. Waxman et al. [1986] derived 
the conditions that give rise to the ambiguity observed by Longuet-Higgins & Prazdny 
(similar results were derived independently by Negahdaripour & Yuille [1986]). These 
conditions can be categorized as follows: 

(l) Three solutions are obtained when d x = d y = 0, the surface has a negative Gaussian 
curvature (d X xdyy—d 2 y < 0), and the mean-scaled curvature is unit > {^{d xx +dyy) = 1). 



(2) Two solutions are obtained if the surface has a non-positive Gaussian curvature 
(d xx d yy — d xy < 0), the mean-scaled curvature is unity {\{d xx + d yy ) = 1), and the 
translation along the line of sight vanishes (t z = 0). When d x /dy = t x /t y , the two 
solutions become equivalent. 

(3) Two solutions are obtained regardless of the sign of the Gaussian curvature when 
d x /dy = t x /ty, (this condition was referred to as the structure-motion coincidence). 
The two solutions degenerate to a unique solution when the translation is along the 
surface normal or when t z = 0. 

(4) In all other cases, the solution is unique. 

Longuet-Higgins & Prazdny claim that these ambiguities are usually only instantaneous, 
and can be resolved at the next time instant. Negahdaripour [1986] shows that an 
ambiguity in the interpretation of the motion field arises only in the case of some quadratic 
surfaces with a non-positive Gaussian curvature at the fixation point. More importantly, 
these surfaces have to be viewed by an observer translating perpendicular to the viewing 
direction. Interestingly, the restriction on the motion of the observer is peculiar to the 
local representation of curved surfaces. Furthermore, the resulting motion field is second- 
order; but this obviously does not imply that any second-order motion field is ambiguous. 
The ambiguity in the case of quadratic surfaces that are viewed by an observer moving 
parallel to the image plane (t 2 =0) is either 

(1) two/three-fold when the surface gradient vanishes (d x = d y = 0), the Gaussian cur- 
vature is zero/ negative (d xx d yy — d\ y < 0), and the mean-scaled curvature is unity 
{\{d xx + d yy ) = 1), or 

(2) two-fold when the Gaussian curvature is negative {d xx d yy — d\y < 0), the mean-scaled 
curvature is unity (|(rfza: + d yy ) = 1), and the surface normal, the optical axis, and 
one of the asymptotic lines of the quadratic surface are in the same plane; that is, 



d x ~ dx v + \My ~ ****** d x ~ d ** ~ v dx * ~ dxxd yy 

either — = or — = - . 

CLy ("XX **y ("XX 

When d x /dy = t x /t y , the ambiguity is resolved because the two solutions become 
identical. 

6 Surfaces That Give Rise to an Ambiguity 

We have given the conditions under which there may be an ambiguity in the interpretation 
of a given motion field resulting from the relative motion between an observer and a curved 
surface. Since the ambiguity is restricted to quadratic surfaces, we can ignore the higher 
order terms in the surface function given earlier. Therefore, an "ambiguous surface" is 
given by 

d = 1 + dpi + -dijurj, 



Motion of Planar Objects 



which can be written 



where 



d = lr T T>r, 
2 

(d>xx dzy "a; 
d X y dyy <ly 

d x d y 2 

If we multiply both sides of the equation for d = 1/Z by Z 2 , we obtain 

Z = -R T DR 
2 

or 

-z T R + -R r DR = 0, 

which is the equation of a quadratic surface passing through the origin (viewing point) . 
This, however, is an artifact resulting from the multiplication of both sides of the equation 
for d by Z 2 ; that is, by multiplying both sides of the equation for d by Z 2 , we have 
artificially made the origin (R = 0) a point on the surface. To be precise, we should 
define the surface by 

-z T R + 2R T DR = 0, R^O. 

This brings up an interesting issue since some previous results derived using a global rep- 
resentation of curved surfaces suggest that an "ambiguous surface" should pass through 
the viewing point (Fang &: Huang [1984], Tsai & Huang [1984], Horn [1986b]). The 
inconsistency in the interpretation of the results can be attributed to the difference be- 
tween the local and global representations of the surface and can be explained through 
the following example: Consider an observer on the inside surface of a cylinder viewing a 
point on the inside surface directly across from him/her (as before, a viewer-centered co- 
ordinate system is assumed) . In the global representation, we consider the whole surface, 
whereas in the local representation, we only consider some neighborhood of the fixation 
point. The viewing point is a point on the surface in the global representation. It will 
not be so in the local representation. The difficulty, however, arises when we consider 
the behavior of the motion field at the origin of the image plane. The question is: Which 
point projects onto the origin of the image plane, the viewing point or the point directly 
across? Consequently, is it the motion of the viewing point or the point across that 
we observe (or measure) in the image? Clearly, it is the latter. Whether we do or do 
not include the origin as a point on the surface seems to depend on the choice of the 
representation we use. 

The signs of the eigenvalues of the matrix D determine the type of the quadratic 
surface. When an ambiguity exists, the eigenvalues of D, in the ascending order, are 
given by (see the Appendix) 



A_ = 1 - \/l + p + q, A = 2, and A+ = 1 + \/l t-p+q, 



where 

~ — 'iy anu. y — u X y — u X x u yy 



p= d x + d y and q = d xy — d xx d s 



In the case of a three-fold ambiguity, we have p = and q > 0. The eigenvalues of D 
become 

A_ = 1 - y/l + q < 0, A = 2, and A+ = 1 + y/T+q > 2. 

Since the ambiguous surface has one negative and two positive eigenvalues, it has to be 
a hyperboloid of one sheet. When q = 0, that is, the Gaussian curvature vanishes, the 
ambiguity degenerates to a two-fold one. In this case, we have 

A_ = 0, A = 2, and A+ = 2. 

and the hyperboloid of one sheet degenerates to a circular cylinder. The other ambiguous 
situation arises when the optical axis, the surface normal, and an asymptotic line are in 
the same plane. In this case, p > and q > 0, and we obtain 

A_ = 1 - y/l + p + q < 0, A = 2, and A + = 1 + y/l + p + q > 2. 

Again, the surface is a hyperboloid of one sheet. Note that, in every case, we have 
A_ + A+ = A and, therefore, there is a constraint among the eigenvalues. We conclude 
that not every hyperboloid of one sheet or circular cylinder can give rise to an ambiguity. 

7 Relationship Among Multiple Solutions 

The conditions given above allow us to determine when we may expect an ambiguity in 
the interpretation of a motion field and the number of possible interpretations. Generally, 
we do not know the correct interpretation in advance. Therefore, we need to obtain all 
of the possible solutions before we can identify the correct one; it is often difficult, if not 
impossible, to obtain a robust closed-form solution in the case of curved surfaces. We may 
need to rely on some type of iterative method to recover motion from a set of non-linear 
equations. An iterative method, however, can converge to only one of up to three possible 
solutions (when multiple solutions exist) depending on the initial condition. Several runs 
of the iterative algorithm may be necessary before we obtain every possible solution; this 
is expensive computationally. Furthermore, in some cases, a solution may be hard to 
obtain if it has a small radius of convergence (this is important if it is the true solution). 
Finally, the true solution may not be an optimum one (in the least-squares sense) with 
noisy data. Therefore, it is not only important to know what circumstances can give rise 
to an ambiguity and the number of possible solutions, it is equally important to know 
the relationship between the true and spurious solutions. 

Suppose we have determined a motion/surface pair, {t,(v} and d = 1/Z, that is 
consistent with the data, where the surface is given by 

1 , 

d = 1 + diU + -dijurj. 
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Furthermore, suppose we know that the solution is not unique (that is, the solution we 
have obtained either fully or, due to noise, approximately satisfies the conditions for an 
ambiguous case). Then we can find another (two other) motion/surface pair(s), {t, w} 
and d, that is (are) consistent with the data, where the surface d is given by 

d = 1 + d^i + -dijrirj. 

Negahdaripour [1986] has derived the relationship among the multiple solutions in the 
two potentially ambiguous cases described earlier. 

7.1 Case One: Three-Fold Ambiguity of Hyperboloids of One Sheet 

There may be three interpretations if the surface of the scene is a hyperboloid of one 
sheet. The "surface of ambiguity" is characterized by the following conditions: 

(1) the surface gradient is zero (d x = d y = 0), 

(2) the Gaussian curvature is negative (d xz d yy — d\ y < 0), and 

(3) the mean-scaled curvature is unity (\{d xx + d yy ) = 1). 

The other solutions, in terms of the first solution, are given by 

t x = aty 

ty = (t X dy i 
d = l 



d x = 
where 



dy=0 



atyd xx )/(2a) 


<JJ X =W X - kdyyl(20Lty) 

%y — Wy — kd xx /(2t y ) 




0J Z = W e 

0>XX =Z ty(lxx/''y 

d X y = —(atxdxx + tydyy) J (2at y ) 

dyy = t x dyyl(aty) 


-d xy ± y/dly - 


~ ("xxU'yy ^, 
— J 1. _ f „.* + U 



a. = 

O-xx 



For each solution of a, we obtain one "spurious" solution from these equations. We 
can alternatively determine both spurious solutions using only one of the solutions for 
a. This requires that we derive another set of relationships for the dual solutions. These 
are given by 



tx = (at x + t y )dyy/(2a) 


u x = w x — ka/(at x + t y ) 


ty = (at x + t y )d xx /2 


Uy = Wy - k/(at x + ty) 


t z = 

d = l 


UJ Z = W z 

d xx = 2t y /(at x + t y ) 


i* = 


d xy = -{t x + at y )/(at x + ty) 


d y = 


dyy = 2at x /(at x + t y ) 



where 

k = -(at x + t y )(at x d xx - t y d yy ) f (2a). 
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To summarize, we can determine the two "spurious" solutions either (1) by substi- 
tuting the two values obtained for a into one of the two sets of relationships given above 
or (2) by substituting one of the two solutions of a into both sets of equations. When 
the Gaussian curvature is zero (with other conditions being the same), the surface degen- 
erates to a circular cylinder and the ambiguity reduces to a two-fold one since the two 
solutions for a become identical. 

7.2 Case Two: Two-Fold Ambiguity of Hyperboloids of One Sheet 

Certain hyperboloids of one sheet can give rise to a two-fold ambiguity. The surface of 
ambiguity is characterized by the following conditions: 

(1) the Gaussian curvature is negative (d xx d yy — d xy < 0), 

(2) the mean-scaled curvature is unity (\{d xx + d yy ) = 1), and 

(3) the surface normal, the optical axis, and one of the asymptotic lines of the quadratic 
surface are in the same plane; that is, 



d x ~ d *v + V dly ~ d xxd yy dx -d xy - Jd\ y - d xx d, 

either — = : or — 



yy 



Cl y dzx dy &xx 

The dual solution is given by 

t x = <*ty oj x = w x — kd yy /(2at y ) 

t y = (t x d yy + at y d xx ) I (2a) u y = w y — kd xx /(2t y ) 

t z = w z = w z + kd y /t y 

u> == A &XX = tyQ'XX/ty 

<lx = txdy/ty dxy = -(at X d X x+ t ydyy ) / (2ttt j,) 

dy = tydy/ty dyy = t X d yy / (at y ) 

where a = d x /d y and A; = (at y — t x )t y . When d x /d y = t x /t y , the ambiguity is resolved 
because the two solutions become identical. 

8 Imposing Depth-Positiveness Constraint to Resolve Ambiguity 

In many cases, an incorrect interpretation, when more than one exists, may be ruled out 
due to the fact that it violates physical constraints. One such constraint is that the depth 
value for a point that is projected onto the image should be positive; that is, a point can 
be seen only if it is in front of the camera (see also Horn [1986b]). 
Consider, once again, the surface 

1 , 
d=l + diu + -dijurj. 
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Since d = 1/Z, then d = implies that Z -+ oo. Therefore, the conic section 

1 + diu + -dijTitj = 

is the image of points at infinity. It is also the boundary between regions with positive 
and negative depth values. The region(s) with positive depth values can be the image of 
the surface under consideration whereas the regions with negative depth values cannot. 
A negative depth value corresponds to a point on a surface that either is behind the 
camera (or translates in the opposite direction). The point cannot belong to the same 
physical surface that is projected onto the region of the image with positive depth values. 
The regions with negative depth values are the image(s) of the background and/ or other 
objects in the scene. Therefore, the conic section is the boundary between the object 
under consideration and the background or other objects in the scene. We can use these 
boundaries to identify the correct solution, provided that the field of view is large enough 
so that the image includes some portion of these boundaries. 

It is more instructive to study an ambiguous situation. We consider the case that gives 
rise to a three-fold ambiguity since it is easier to derive the equations for the minimum 
size of the field of view. In this case, the surface is given by 

d = 1 + -d xx x 2 + d xy xy + -d - 2 



with the constraints 



d xx dyy - d xv < and -{d xx + d yy ) = 1. 



Suppose the image plane is circular with radius r (half-angle field of view is tan -1 (r) 
degrees) . In order to have some portion of the boundaries of the true surface within the 
field of view, we should have r > r c , where r c is the shortest distance from the origin of 
the image plane to a point on the conic section 

1 + -dxxx 2 + d xy xy + -d yy y 2 = 0. 

Ideally, we want a larger field of view so that we can have as much of the boundary in 
the image as possible. 

When d xy ^ 0, it can be shown that r c = \Jx\ + y 2 , where 



4- -2 

x c = it 4 / 5 , y c = mx c , 

V dyym 2 + 2d xy m + d xx 



and 

m 



dyy d zx _^ , dyy d xx . g ^ 

2d xy y 2d xy 
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Only one of the two signs gives a value of m that makes the solution for x c and y c real- 
valued (the proof follows easily from the geometry of the problem). When d xy — 0, then 
either d xx < or d yy < (because d xx d yy - d\ y = d yy d xx < 0). In this case, we instead 
have either 

(x c = 0, y c = J— J if d y 



dyy 



W<0 



or 



[y c =0,x c = J—) if eks < 0. 



We can derive similar expressions for r c , the shortest distance from the origin of the image 
plane to a point on the conic section of either spurious solution. Now if r c > r > r c , the 
image includes some portion of the boundaries of the true surface but no part of the 
boundaries of the spurious solution or if r > r c and r >r c , then the image should include 
some portion of the boundaries of both the true and spurious solutions. It is only when 
r < min(r c ,r' c ) that we cannot identify the true solution since then the depth values are 
positive everywhere in the image for every possible solution. 

Example: Consider a viewer moving with translational velocity t = [1,2, 0] T with re- 
spect to surface d defined by 

d = 1 + 0.5x 2 + lOxy + 0.5y 2 . 

Using the equations given earlier, the spurious solutions are: 

(1) An observer moving with translational and rotational velocities ti = [.445, —8.98, 0] 
and w\ = [—11.0, .550, 0] T with respect to surface d\ given by 

di = 1 + -.112x 2 - 2.17xy + l.lly 2 . 

(2) An observer moving with translational and rotational velocities t2 = [—19.5, .975, 0] T 
and u)% = [—1.03, 20.5,0] T with respect to surface di given by 

d 2 = 1 + +1.03x 2 - .461xy - .026y 2 . 

The boundaries of the three surfaces are shown in Figure 2. For each solution, the regions 
of negative depth values are shown by hatched lines. The resulting second-order motion 
field is given by 

(-1 - 0.5x 2 - lOzy - 0.5y 2 
-2 - x 2 - 20xy - y 2 

which is shown in Figure 3 (the image plane is a unit square; that is, the field of view is 
2 tan -1 (0.5) « 54 degrees). Note that the velocity vectors are all parallel emanating from 
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the focus of expansion at infinity in the region where the depth values are positive (the 
region including the bottom-left, the center, and the top-right of the image). They point 
in the opposite direction in the bottom-right and top-left of the image. This implies that, 
for points in these regions, either the motion is in the opposite direction (to the true one) 
or the depth values are negative. In either case, these regions cannot be the images of 
parts of the same object that is imaged into the region with positive depth values. 

The apices of the conic section of the true surface are located at ±(1/3,-1/3). If 
the field of view is larger than 2tan -1 (\/2/3) « 51 degrees, we can identify the correct 
solution by matching the boundaries of depth discontinuity in the image with the conic 
section of the true surface. 

The two spurious solutions involve a viewer rotating about an axis parallel to one of 
the asymptotic directions and translating parallel to the other asymptotic direction of 
the true surface. This is quite counter-intuitive since the motion field suggests that the 
underlying 3D motion is purely translational. 

9 Summary 

In this paper I have presented some results concerning the ambiguity in the interpretation 
of the motion of curved surfaces. These results suggest that only certain hyperboloids of 
one sheet or circular cylinders viewed by an observer moving parallel to the image can 
give rise to an ambiguity in the interpretation of the underlying motion. In the case of 
hyperboloids of one sheet, the ambiguity can be either two-fold or three-fold, whereas 
there can be at most two solutions in the case of circular cylinders. In either case, the 
resulting motion field is second-order. I have also given analytical expressions for the 
relationship among multiple solutions. In most cases, an ambiguity can be resolved by 
imposing the positive-depth constraint. 
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Regions of Negative Depth 
Values for Surface 1 

Regions of Negative Depth 
Values for Surface 2 
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Figure 2. The images of the boundaries of three hyperboloids of one sheet responsible for a three-fold 
ambiguity. The ambiguity in a motion field is usually restricted to small region of the image. For a 
sufficiently large field of view, the ambiguity can be resolved by matching the depth discontinuity boundaries 
with the conic section of the true surface. 
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Figure 3. A motion field with three rigid body motion interpretations. 
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Appendix 



The eigenvalues of 

D = 

are the solutions of the characteristic equation 

A 3 + a 2 A 2 + aiA + o = 0, 

where 

02 = —(2 + d xx + dyy), 

a x = 2{d xx + dyy) + [d xx dy V - d xy ) - (d x + d 2 y ), 

a = —2{d xx d yy - d xy ) + (d xx d y - 2d xy d x d y + d yy d x ). 

One of the conditions in the case of an ambiguity is that the mean-scaled curvature is 
unity; that is, \{d xx + d yy ) = 1. Therefore, we have 

a 2 = -4, 

«i = 4 + {d xx d yy - d\ y ) - (d 2 x + dj), 

a = —2(d xx dyy — d xy ) + (d xx d y — 2d xy d x d y + dy V d x ). 

Another condition is that either the surface gradient should vanish (d x = d y = 0) or the 
optical axis, the surface normal and one of the asymptotic lines of the surface are in the 
same plane. In either case, we obtain 

d xx d y — 2d xy d x d y + d yy d x = 0. 

Using this in the earlier equations and simplifying the results, we can show that the 
characteristic equation simplifies to 

(X-2)(X 2 -2X-(p + q))=0, 

where 

P=d x + d y and q — d xy - d xx d yy . 

Therefore, the eigenvalues of D, in the ascending order, are 

A_ = 1 - y/l + p + q, A = 2, and A + = 1 + \/l + p + q. 
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